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DETAILED ACTION 
Claim Rejections • 35 USC § 101 

1 . 35 U.S.C. 101 reads as follows: 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of matter, or 
any new and useftil improvement thereof, may obtain a patent tiierefor, subject to the conditions and 
requirements of this title. 

2. Claims 26, 39 are rejected under 35 U.S.C. 101 because the claimed invention is directed 
to non-statutory subject matter. As per the most recent interpretation of the Interim Guidelines 
regarding 35 U.S.C. 101, claim 26 define non-statutory processes because the claim is toward 
computer code, and as claimed, does not define any structural and fimctional interrelationship 
between the computer program and other claimed elements of a computer which permit the 
computer program's functionality to be realized (Warmerdam, 33 F.3d at 1361,31 USPQ2d at 
1760; Lowry, 32 F.3d at 1583-84, 32 USPQ2d at 1035). 

Claim Rejections - 35 USC§ 112 

3. The following is a quotation of the second paragraph of 35 U.S.C. 1 12: 

The specification shall conclude with one or more claims particularly pointing out and distinctly claiming the 
subject matter which the applicant regards as his invention. 

4. Claims 1,3-48 are rejected under 35 U.S.C. 1 12, second paragraph, as being indefinite for 
failing to particularly point out and distinctly claim the subject matter which applicant regards as 
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the invention. As per independent claims 1,19,22,27,31,32, the claim recitations pertaining to 
obtaining/segmenting audio signals based upon audio characteristics are vague and indefinite 
because it is not clear as to which segmenting aspect of applicants disclosure this refers. For 
example, applicants disclosure presents two segmenting sections, the first being 1) a typical 
audio encoder that extracts audio signal information (outputting segments based upon 
voice/unvoiced, silence decision - fig. 4, line 110 into subblock 12, generating segmented audio 
with associated parameters - line 1 12; and applicants specification, page 13 lines 8-14); and the 
second 2) being the re-segmented sequence of initial segments based upon a degree of voicing, 
etc., derived fi-om speech parameters (fig.4, subblock 20, and applicants spec, page 15, lines 1- 
1 7). The current claim scope does not distinguish between these two sections of applicants 
disclosure and as such, these claims are rejected under 35 U.S.C. 1 12 second paragraph. For art 
related examination purposes only, examiner will interpret the claim scope to read upon the first 
section discussed above, namely, the encoder section of Fig. 4 that encompasses only line 110, 
subblock 12, and line 112. The dependent claims do not remedy the deficiencies of the 
independent claims, and as such, are also rejected under 35 U.S.C. 1 12, second paragraph. 

5. Claims 26,39 are rejected under 35 U.S.C. 1 12, second paragraph, as being incomplete 
for omitting essential structural cooperative relationships of elements, such omission amounting 
to a gap between the necessary structural connections. See MPEP § 2172.01. The omitted 
structural cooperative relationships are: a relationship between the computer software product 
and the encoder - how is the computer software product code performing the method steps of 
claim 1 ? (i.e. the relationship between the computer codes being accessed as computer 
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executable instructions, and these computer executable instructions causing a processor to 
perform the method steps, is missing). 

Claim Rejections - 35 USC § 102 

6. Claims 1, 3-14, 19-21,26-37, 39-44,46-48 are rejected under 35 U.S.C. 102 (b) as being 
anticipated by Gersho et al. (6,31 1,154). 

As to claim 1, Gersho et al. teach segmenting {partitioning or classifying} the audio 
signal {speech} into a plurality of segments {frames} (partitioning samples of a speech signal 
into frames, col. 4, lines 25-27) based on the audio characteristics {classes} of the audio signal 
(classifying the speech signal in each from into one of a plurality of classes, col. 4, lines 25-27); 
and encoding the segments {frames} with different encoding settings {excitation} (encoding an 
excitation for the frame using one of a plurality of excitation coding... selected according to the 
class of the frame, col. 4, lines 30-33). 

As to claim 3, Gersho et al. teach characteristics {classes/classifying} include voicing 
characteristics {voice} in said segments {frames} of the audio signal { speech signal} 
(classifying the speech signal in each frame into classes, classes include voice frame, col. 4, lines 
25-27 & 35). 

As to claim 4, Gersho et al. teach characteristics {identifying} include energy 
characteristics {presence of energy} in said segments {window} of the audio signal {residual 
signal } (identifying the location of a window, identifying considers the presence of energy in the 
residual signal, col. 4, lines 65-67). 
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As to claim 5, Gersho et al. teaches characteristics {positioning} include pitch 
characteristics {function of the pitch} in said segments {frames} of the audio signal (positioning 
the window at a location that is a function of a pitch of the frame, col. 4, lines 59-61). 

As to claim 6, Gersho et al. teach segmenting {partitioning} is carried out concurrently 
{classifying and encoding} to said encoding {coding} (partitioning samples of speech, 
classifying speech signals into classes, coding a speech signal, col. 4, lines 24-25. The 
classifying and encoding process may be done concurrently). 

As to claim 7, Gersho et al. teach segmenting is carried out before said encoding 
(partitioning samples of speech, classifying speech signals into classes, coding a speech signal, 
col. 4, lines 24-25, thus the classifying or segmenting is done before coding). 

As to claim 8, Gersho et al. teach plurality of voicing values {voice or unvoiced} are 
assigned to the voicing characteristics of the audio signal in said segments, and wherein said 
Segmenting {partitioning} is carried out based on the assigned voicing values (classifying a 
frame is being one of an unvoiced or voiced, col. 4, lines 52-53). 

As to claim 9, Gersho et al. teach a value designated {classifying} to a voiced speech 
signal and another value designated to an unvoiced signal (classifying a frame is being one of an 
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unvoiced or voiced, col. 4, lines 51-52). 

As to claim 10 Gersho et al. teach a value designated {classifier} to a transitional stage 
between the voice and unvoiced {transitional} signals {frame} (classifier for classifying a 
transition frame, col. 4, lines 52-55) 

As to claim 11, Gersho et al. teach a value designated {(m)=l } to an inactive period 
{silent frame} in the audio signal {speech} (If (m)=l, then the current frame is declared a silent 
frame, col. 15, lines 7-8 & 35-37). 

As to claim 12, Gersho et al. teach selecting a quantization mode for said encoding in 
order to improve the bit allocation and to reduce the parameter update rate, wherein the 
segmenting is carried out based on the selected quantization mode (col. 3 lines 45-49; Fig. 5 and 
col. 1 1 lines 4-16; col. 4, lines 36-37, col. 15, lines 35-36 & col. 9, lines 63-65). 

As to claim 13, Gersho et al. teach segmenting is carried out based on target accuracy in 
reconstruction of the audio signal, wherein the target accuracy is selected based on distortion 
criteria comparing up-sampled quantized values (transmitted samples) and modified parameters 
(col. 9, lines 63-65 and col. 3 lines 45-49), 

As to claim 14, Gersho et al. teach segmenting is carried out for providing a linear pitch 
representation in at least some of said segments (col. 9, lines 63-65; col. 3 lines 45-49 and col. 4 
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lines 50-62). 

As to claim 19 and 27, Gersho et al. (154) teach an input for receiving audio data 
indicative of the parameters in the adjusted representation (input applied to element 14, Fig. 3). 
and a module responsive to the audio data for generating the audio signal based on the adjusted 
signals and the characteristics of the audio signal (Fig. 3. One would necessarily need a module 
to respond to an adjusted audio signal/characteristics of audio signals). 

At the time of the invention, it would have been inherent to one of ordinary skill in to use 
a decoder in order to reverse the encoding data for further processing, such as modulating or 
storing the audio signal. 

As to claim 20 and 28, Gersho et al. (154) teaches recording parameters (col. 29 lines 25- 

35); 

As to claim 21 and 29, Gersho et al. (154) teach, 
the audio data is transmitted through a communication channel and wherein the input of the 
decoder is operatively connected to the communication channel for receiving the audio data 
(digital communications, col. 1, line 1 and Fig. 3). 

As to claim 26, Gersho et al. (154) teach, a code for determining the characteristics of the 
audio signal (LP coding, col 8 lines 54- a code for adjustment the parameter based on the 
characteristics of the audio signal for providing an adjusted representation of the parameter. 
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wherein said adjusting comprises the steps of segmenting the audio signal into a plurality of 
segments based on the characteristics of the audio signal and encoding the segments based on 
one or more of a plurality of encoding settings (LP coding, modified residual, adjusts frames. 
Abstract and Fig. 9; col. 8 lines 54-63). 

) 

As to claim 30, Gersho et al. (154) teaches a mobile terminal (mobile base station, col. 6, 
lines 17-18). 

As to claim 31, Gersho et al. (154) teaches implementing in a cell phone system which 
necessarily has both base station and mobile station adapted to communicating with the base 
stations (col. 6, lines 33-36); a decoder for use in parametric audio coding for generating a 
synthesized audio signal indicative of an audio signal having audio characteristics, wherein the 
audio signal is coded in a coding step into a plurality of parameters at a data rate and the 
encoding step is adjusted based on the characteristics of the audio signal for providing an 
adjusted representation of the parameters, wherein the said adjusting comprises the steps of 
segmenting the audio signal into a plurality of segments based on the characteristics of the audio 
signal and encoding the segments based on one or more of a plurality of encoding settings (Figs 
1, 4-5, LP coding, modified residual, adjusts frames, Abstract and Fig. 9; col. 8 lines 54-63 ).. 
an input for receiving audio data indicative of the parameters in the adjusted representation from 
at least one of the base stations for providing the audio data to the decoder, so as to allow the 
decoder to generate the synthesized audio signal based on the adjusted representation (Figs 1, 4- 
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5, col. 3 lines 1-15). 

As to claim 32, Gersho et al. (154) teach, an input for receiving audio data indicative of 
end points defining a plurality of sub-segments, wherein the audio signal is encoded for 
providing parameters indicative of the audio signal, the parameters including pitch contour data 
containing a plurality of pitch values representative of an audio segment in time, and wherein the 
pitch contour data in the audio segment in time is approximated by a plurality of consecutive 
sub-segments in the audio segment, and wherein the end points include a first end point and a 
second en d point for defining each of said sub-segments (decoder, col. 6 lines 8-1 1 and Fig. 1); 
and a reconstruction module for reconstructing the audio segment based on the received audio 
data (Fig. 9; col 6 lines 8-11). 

As to claim 33, Gersho et al. (154) teach encoding settings inherently include bit 
allocation (col. 3 lines 45-49), quantization accuracy (Fig. 5 and col. 1 1 lines 4-16), quantization 
method (col. 1 1 lines 4-16) and parameter update rate (col. 3 lines 31-44 and 56-60). 

As to claim 34, Gersho et al. (154) teach, the audio signal contains sinusoidal 
components (col. 3 lines 25-29, analysis windows made equal becomes sine) and said parameters 
include frequency values (Fig. 1 element 68), amplitude values (col. 3 lines 51-55) and phase 
values indicative of the sinusoidal components (Fig. 1 element 76 and col. 3 lines 25-29). 
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As to claim 35, Gersho et al. (154) teach the parameters includes pitch (col. 4 line 60), 
voicing f(Fig. 9 element 42c), amplitude (col. 3 lines 51-55) and energy of the audio signal (col. 
3 lines 42-44). 

As to claim 36, Gersho et al. (154) teach the parameters include pitch contour data (col. 4 
line 60-61) containing a plurality of pitch values inherently representative of an audio segment in 
time (col. 4 lines 59-63 and col. 2 lines 51-64). 

As to claim 37, Gersho et al. (154) teach encoding settings inherently include bit 
allocation (col. 3 lines 45-49), quantization accuracy (Fig. 5 and col. 1 1 lines 4-16), quantization 
method (col. 1 1 lines 4-16) and parameter update rate (col. 3 lines 31-44 and 56-60, Fig. 4, 8-9 
and 14). 

As to claim 40, Gersho et al. (154)teach encoding settings inherently include bit 
allocation (coL 3 lines 45-49), quantization accuracy (Fig. 5 and coL 1 1 lines 4-16), quantization 
method (col. 1 1 lines 4-16) and parameter update rate (col. 3 lines 31-44 and 56-60, col. 6 lines 
8-1,1). 

As to claim 41, Gersho et al. (154)teach, wherein the audio signal comprises a plurality of 
frames and the audio signal in each frame has a waveform and wherein the further audio signal is 
produced in the decoding stage independently of the waveform (col. 14 lines 8-14; col. 13 lines 
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62-67 and col. 14 lines 1-7). 

As to claim 42, which depends on claim 1, Gersho et al. (154) teach 
wherein each segment has a segment length and wherein the segment length of at least one 
segment is different from the segment length of at least one other segment (col. 14 lines 8-14; 
col. 13 lines 62-67 and col. 14 lines 1-7). 

As to claim 43, which depends on claim 19, Gersho et al. (154) teach wherein the audio 
signal comprises a plurality of frames and the audio signal in each frame has a waveform and 
wherein the module generates the further audio signal independently of the waveform (col. 14 
lines 8-14, col. 13 lines 62-67 and col. 14 lines 1-7). 
.14). 

As to claim 44, which depends on claim 19, Gersho et al. (154) teach 
wherein the segments comprise segments of different segment lengths (col. 14 lines 8-14). 

As to claim 46, which depends on claim 26, Gersho et al. (154) teach 
wherein the segments comprise segments of different segment lengths (col. 14 lines 8- 
14). 

As to claim 47, which depends on claim 31, Gersho et al. (154) teach 
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wherein the segments comprise segments of different segment lengths (col. 14 lines 8- 
14). 

As to claim 48, which depends on claim 32, Gersho et al. (154) teach 
wherein the segments comprise segments of different segment lengths (col. 14 lines 8- 
14). 

7. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the 
basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b), by another filed 
in the United States before the invention by the applicant for patent or (2) a patent granted on an application for 
patent by another filed in the United States before the invention by the applicant for patent, except that an 
international application filed under the treaty defined in section 351(a) shall have the effects for purposes of this 
subsection of an application filed in the United States only if the international application designated the United 
states and was published under Article 21(2) of such treaty in the English language. 

r 

8. Claims 15-18,22-25,38,45 are rejected under 35 U.S.C. 102(e) as being anticipated by 
Sinhaetal (7191136) . 

As per claims 1 5,22,23,45, Sinhaetal (7191136) teaches a method for use in a 
parametric audio coding to encode an audio signal by segmenting the audio signal into a 
plurality of segments based on audio characteristics of the audio signal (by high pass 
filtering the input audio signal (col. 4 lines 47-51), and then performing a non-linear 
parametric representation of the signal - col. 4 lines 53-59; wherein the data amount per 
processing depends upon the frequency characteristics of the audio signal, and the 
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characteristics analyzed can be peak analysis, lattice quantization, or frequency range 
selection - col. 3 lines 1-6); encoding the segments with different encoding settings (by 
choosing compression settings on-the-fly -> col. 6 lines 43-47); also teaching upsampling 
(col. 7 lines 42-44) or downsampling (col. 7 lines 39-46). 

As per claims 16-18,38, Sinhaetal (7191136) teaches quantized and unquantized 
features (col. 3 lines 1-6) 

As per claim 24, Sinhaet aU7191136) teaches storage mediums (col. 8 lines 1-9). 

As per claim 25, Sinhaet al (7191136) teaches header information transmitted 
over communication channels (col. 6 line 64 - col. 7 line 7). 

Response to Arguments 

9. Applicant's arguments filed 12/3/07 have been fully considered but are unpersuasive. . 
As per the arguments against the Gersho reference, examiner argues that the current claim scope 
still reads upon the Gersho reference, as detailed in the 35 U.S.C. 1 12 reference above. The 
amendment filed 12/3/07 has canceled the structural relationships that would have defined 
applicants disclosed invention, namely, the combination of subblocks 12, 20, and signal flowpath 
1 10 and 112. However, the broad current claim scope still reads upon Fig. 4, subblock 12, and 
the disclosed encoder/extraction process detailed in applicants specification with respect to this 
aspect of applicants disclosure. Furthermore, examiner notes that the process in applicant's 
disclosure depends upon an initial partitioning of the audio signal - the parameters that are 
extracted from subblock contain not only voicing/energy/spectral characteristics, but timing 
information as well, such timing information used in subblock 20 to determine the final 
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sectioning of the audio signal. If this timing-partitioning information was not available to 
subblock 20, then the decision making process of subblock 20 would be ineffective. Lastly, 
examiner wishes to point to applicant the following prior art - Teguchi (4701955), teaching 
variable frame length vocoder - determining the frame length based on spectral parameters of the 
audio signal (col. 4 lines 29-60; col. 3 lines 40-60); and Manjunath (6434519), Fig.5, performing 
energy parameter analysis on received samples, and performing different encoding methods 
based on different encoding rates (different encoding rate with different size frames); Stachurski 
et al (7039581) applying different mode encoding based upon extracted parameters (these 
references can be applied to different aspects of applicants disclosure). 



10. The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure. Please see related art listed on the PTO-892 form. 



1 1 . Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Michael Opsasnick, telephone number (571)272-7623, 
who is available Tuesday-Thursday, 9am-4pm. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Mr. Richemond Dorvil, can be reached at (571)272-7602. The fax phone number for 
the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 



Conclusion 



jv.no MICHAEL OPSASNICK 
AU26fl»"^"Y EXAMINER 

02/12/08 /pf^l^fK^ 




